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Abstract. We explore a simple mathematical model of network computation, based on Markov chains. 
Similar models apply to a broad range of computational phenomena, arising in networks of computers, 
as well as in genetic, and neural nets, in social networks, and so on. The main problem of interaction 
with such spontaneously evolving computational systems is that the data are not uniformly structured. 
An interesting approach is to try to extract the semantical content of the data from their distribution 
among the nodes. A concept is then identified by finding the community of nodes that share it. The 
task of data structuring is thus reduced to the task of finding the network communities, as groups of 
nodes that together perform some non-local data processing. Towards this goal, we extend the ranking 
methods from nodes to paths, which allows us to extract information about the likely flow biases from 
the available static information about the network. 

1 Introduction 

Initially, Web search was developed as an instance of information retrieval, optimized for a particularly large 
distributed database. With the advent of online advertising, Web search got enhanced by a broad range of 
information supply techniques where the search results are expanded by additional data, extrapolated from 
user's interests, and from search engine's stock of information. From the simple idea to match and coordinate 
the push and the pull of information on the Web as a new computational platform [18| sprang up a new 
generation of web businesses and social networks. Similar patterns of information processing are found in 
many other evolutionary systems, from gene regulation, protein interaction and neural nets, through the 
various networks of computers and devices, to the complex social and market structures [TS] . 

This paper explores some simple mathematical consequences of the observation that the Web, and similar 
networks, are much more than mere information repositories: besides storing, and retrieving, and supplying 
information, they also generate, and process information. We pursue the idea that the Web can be modeled as 
a computer, rather than a database; or more precisely, as a vast multi-party computation [B], akin to a market 
place, where masses of selfish agents jointly evaluate and generate public information, driven by their private 
utilities. While this view raises interesting new problems across the whole gamut of Computer Science, the 
most effective solutions, so far, of the problem of semantical interactions with the Web computations were 
obtained by rediscovering and adopting the ranking methods, deeply rooted in the sociometric tradition 
[llllOj . and adapting them for use on very large indices, leading to the whole new paradigm of search 
[19112113] . Implicitly, the idea of the Web as a computer is tacitly present already in this paradigm, in 
the sense that the search rankings are extracted from the link structure, and other intrinsic information, 
generated on the Web itself, rather than stored in it. 

Outline of the paper. In section [2] we introduce the basic network model, and describe a first attempt to 
extract information about the flows through a network from the available static data about it. In sections [3] 
and[31 we describe the structure which allows us to lift the notion of rank, described in section [5l to path 
networks in section [51 Ranking paths allows us to extract a random variable, called attraction bias, which 
allows measuring the mutual information of the distributions of the inputs and the outputs of the network 
computation, which can be viewed as an indicator of non-local information processing that takes place in 
the given network. In the final section, we describe how the obtained data can be used to detect semantical 
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structures in a network. The experimental work necessary to test the practical effectiveness of the approach 
is left for future work. 

2 Networks 

Basic model. We view a network as an edge-labelled directed graph A = [R — E i N), where N 

e 

and E are, respectively, the finite sets of nodes, and links, or edges, whereas R is an ordered field of values 
(in some applications an ordered ring of functions). A link i — > j is thus an element e G E with S{e) — i and 
g{e) = j. The value 7(e) is the cost (when positive), or payoff (when negative) of the traffic over e. These 
data induce the adjacency matrix E — {Eij)NxN and the capacity matrix A — {Aij)NxN, with the entries 
Eij = {e £ E I i j} and Aij = X^eeB ^e, where A^ — 2^^'^'^^ is the capacity of the link e. 

Remark. The term " capacity" is used here as in network flow theory^ The cost or the payoff of a link may 
represent its value in a pay-per-click model of a fragment of the Web; or it may denote the proximity of the 
web pages within the same site, or within a group of interconnected sites. In a protein network, the energy 
cost or payoff may be derived from the chemical affinities between the nodes. While this parameter can be 
abstracted away, simply by taking 7(e) = for all links e, its role will become clear in sections |3] and [6l 
where it allows discounting and eliminating some paths. 

Basic dynamics. The simplest model of network dynamics is based on the assumption that the traffic flows 
are distributed proportionally to the link capacities. Randomly sampling the Web traffic, we shall thus find 
a surfer on a link e with the probability ae = where A, = X/eB ^f- 

In order to find the communities in a network, we need to detect the traffic biases between its nodes. We 
assume that the traffic between the nodes within the same community will be higher than the capacity of the 
links between them would lead us to expect; and that the traffic between the different communities will be 
lower than expected. To measure such traffic biases, we normalize the capacity matrix A to get the capacity 
distribution a = {aij)NxN as aij = -j^, where A,, = J2k ieN ^kt- The entry Uij is thus the probability 
that a random sample of traffic on A, following the simple dynamics proportional to capacity, will be found 
on a link from i to j. On the other hand, the marginals of the probability distribution a, 

= '^aij a,j = '^Oi.j 

correspond, respectively, to the probabilities that a random sample of traffic will have i as its source, and j 
as its destination. Let us call ai, the out-rank of i, and a,j the in-rank of j, because they can be viewed as 
the simplest, albeit degenerate cases of the notion of rank. 

If the in-rank and the out-rank are statistically independent, then (by the definition of independence) the 
probability that a random traffic sample goes from i to j will be a^.a.j. Their dependency is thus measured 
by the trajfic bias matrix v = {vij)j^xN with the entries Vij — aij — aita^j falling in the interval [—1, 1]. 
The higher the bias, the more unexpected traffic there is. For a set of nodes U C N the values 

Coh{U) = ^ Adh([/) = ^ Vij -f Vji 

iJeU i£U,j^U 

can thus be construed as the cohesion and the adhesion forces: the total traffic bias within the group, and 
with its exterior, respectively. A network community U can thus be recognized as a set of nodes with a high 
cohesion and a low adhesion |16j . The idea that semantically related nodes can be captured as members 

^ The information theoretic homonym has a different, albeit related meaning, which motivates the choice of 7(e) = 
-log^Ae. 
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of the same network communities, derived from the graph structure, is a natural extension of the ranking 
approach, which has been formahzed in f9'17]. 

The only problem with applying that idea to the above model is that our initial assumption — that the 
traffic distribution on A is proportional to its link capacities — is not very realistic. It abstracts away all 
traffic dynamics. On the other hand, the static network model, as given above, does not provide any data 
about the actual traffic. We explore the ways to solve this problem, and extract increasingly more realistic 
views of traffic dynamics from a static network model. 

3 Adding paths 

A path i A j in a network ^ is a sequence of links « — + fci ^ fc2 ^ ■ • ■ ^ In many cases of interest, traffic 
dynamics on a network depends on the path selections, rather than just on single links. 

One idea is to add the paths to the structure of a network, and to annotate how the links compose into 
paths, and how the paths compose into longer paths. This amounts to generating the free category [14 over 
the network graph. Unfortunately, adding all paths to a network usually destroys some essential information, 
just like the transitive closure of a relation does. E.g., in a social network, a friend of a friend is often not 
even an acquaintance. Taking the transitive closure of the friendship relation obliterates that fact. Moreover, 
the popular "small world" phenomenon suggests that almost every two people can be related through no 
more than six friends of friends of friends. . . So already adding all paths of length six to a social network, 
with a symmetric friendship relation, is likely to generate a complete graph. In fact, the average probability 
that two of node's neighbors in an undirected graph are also linked with each other is an important factor, 
called clustering coefficient 2()\. On the other hand, in some networks, e.g. of protein interactions, a link 
i k which shortcuts the links i —> j k often denotes a direct feed-forward connection, rather than a 
composition of the two links, and leads to essentially different dynamics. 

So only "short" paths must be added to a network: composition must be penalized. 

Definition 1. For a given network A — [R — E t N), a cutoff value v E R, and a composition 

Q 

penalty d E R, we define the w-completion to be the network A*"" = (R -« — E*"" t N) , where 

s 

E*'" = {ae E* \ 7(a) < v} and 

7(^0 ^ ii ^ i2 ^ ■ ■ ■ ^ in) ^ (n - l)d + 7(ai) H h 7(a") 

and E* is the set of all nonempty paths in A. 

Remarks. E* can be obtained as the matrix of sets E* = X^^o where each i?" is a power of the adjacency 
matrix E. If the entry Eij is viewed as the set of links {i j}, then the entry Efj = J2k=i ^-ik ■ Ekj 
corresponds to the set of 2-hop paths {i ^ k ^ j} through the various nodes k; the matrix E^ similarly 
corresponds to the matrix of 3-hop paths, and so on. 

The -y-closed network A*^ is closed under the composition of low cost paths, but not if the cost is greater 
than V. It is not hard to see that the w-completion is an idempotent operation, i.e. A™*"" = A*^ , but it may 
fail to be a proper closure operation, because a link e in A, with 7(e) > v, may lead to A% A*'" . 

In the rest of the paper, we assume that the networks are v- complete for some v, i.e. A = A*^ . This 
means that the relevant pathways are already represented as links, with the composition penalty absorbed 
in the cost. 

4 Network dynamics 

In order to derive network dynamics from a static network model, one first specifies the way in which the 
behavior of a computational agent, processing data on the network, is influenced by the network structure, 
and then usually derives a Markov chain that drives the traffic. The network features that influence its 
dynamics can then be incrementally refined, yielding more and more information. 
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4.1 Forward and backward 

Random walks on networks are often represented in terms of the behavior of surfers on the Web, following 
the hyperlinks0 The simplest surfer behavior chooses an out-link uniformly at random at each node. A 
visitor of a node i will thus proceed to a node j with probability A^j — where Ai, = X^feLi -^ik ^^e 
out-degree of i. The row-stochastic matrix = iAfj)NxN represents forward dynamics of a network A. 
The entries A^j are called the pull coefficients of i by j. 

Dually, backward dynamics of a network A is represented by a column-stochastic matrix A*' = (A^ )ArxAf, 

where the entry Afj — -j^, with A^j = X^a^i ^kj denoting the in-degree of j, describes the probability that 
a surfer who is on the node j came there from the node i. The entries Afj are called the push coefficients. 

Remark. Note that the capacity matrix can be normalized to get A'^ and A"^ as above only if no rows, resp. 
columns, consist of Os alone. This means that every node of the network must have at least one out-link, 
resp. at least one in-link. Networks that do not satisfy this requirement need to be modified, in one way or 
another, in order to enable analysis. Adding a high-cost link between every two nodes is clearly the minimal 
perturbation (with maximal entropy) that achieves this. Alternatively, the problem can also be resolved by 
adjoining a fresh node, and the high-cost links in and out of it [2]. Either way, the quantitative effect of such 
modifications can be made arbitrarily small by increasing the cost of the added links. 



4.2 Forward-out and backward-in dynamics 

The next example can be interpreted in two ways, either to show how forward and backward dynamics can 
be refined to take into account various navigation capabilities, or how to abstract away irrelevant cycles. 
Suppose that a surfer searches for the hubs on the network: he prefers to follow the hyperlinks that lead to 
the nodes with a higher out-degree. This preference may be realized by annotating the hyperlinks according 
to the out-rank of their target nodes. Alternatively, the surfer may explore the hyperlinks ahead, and select 
those with the highest out-degree; but we want to ignore the exploration part, and simply assume that he 
proceeds according to the out-rank of the nodes ahead. The probability that this surfer will move from i to 
j is thus 

A - - A - 

-rl-tm-rl-m* 

We call this the forward-out dynamics. In the dual, backward-in dynamics, the surfers are more likely to 
arrive to j from i if this is a frequently visited node, i.e. if its in-rank is higher 

Aii — a,i ■ Aj 



A„A,j 



These dynamics will be the particularly convenient to demonstrate an example of bias analysis in section [6l 
because they clearly display clearly how the simple traffic bias v from section [2] can be refined by the various 
dynamics factors. 



4.3 Teleportation and preference 

The main point of formulating network dynamics, especially in the Markov chain form, is to be able to 
compute the node ranking as its invariant distribution. However, since the network graphs are usually not 
strongly connected, the Markov chains, derived from their structure, are often reducible to classes of nodes 
with no way out. 

The simplest remedy is the idea of teleportation, going back to |19| . A general interpretation is that, 
whichever dynamics a surfer might follow, at each node he tosses a biased coin, and with a probability 
d g (0, 1) follows that dynamics. Otherwise, with a probability 1 — c?, he "teleports" to a randomly chosen 

^ The surfers deserve their name by following the "waves", i.e. obeying the same dynamics. 
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node, ignoring all hyperlinks and other structure. Following, say, forward dynamics, the probability that he 
will go from i to j is thus Afj = dA^j + ^^- This is roughly the PageRank dynamics, from which the Google 
search engine had started 19jf|. The induced dynamics is thus = dA'^ + (1 — d)P, where P ~ {Pij)NxN 
has all entries Pij = . In the networks without a cost function, this is interpreted as adding a link between 
every two nodes. The influence of such links can be controlled using the cost functions. In any case, the 
resulting Markov chains become irreducible, and their stationary distributions do not get captured in any 
closed components. Furthermore, the model can be personalized by capturing surfer's preferences in terms 
of the biases in P: the entries Pij can be interpreted as i's trust in j [5]. The extensions of the backward 
dynamics by teleportation yields to different interpretations, which the reader may wish to consider on her 
own. 



5 Ranking 

Intuitively, the rank of a node is the probability that randomly sampled traffic will be found to visit that 
node. In search, this is taken as a generic relevance measure. The technical implication is that the rank can 
be obtained as a stationary distribution of the Markov chain capturing dynamics. Each notion of dynamics 
thus induces a corresponding notion of rank. Since a Markov chain can be viewed as a linear, and hence 
continuous transformation of the simplex of distributions, which is closed and compact, already Brouwer's 
fixed point theorem guarantees that the rank always exists. Finding a meaningful, useful notion of rank is 
another matter. 

First of all, as already mentioned, networks often decompose into loosely connected subnets. In the long 
run, all traffic is likely to get captured in some such subnet. This results in multiple stationary distributions, 
each concentrated in a closed subnet, zero otherwise. Dynamics derived directly from the network graph 
therefore result in uninformative ranking data. In order to assure that the relevant Markov chains are irre- 
ducible and aperiodic, and thus induce unique and nondegenerate stationary distributions, network dynamics 
usually need to be perturbed, using a damping and stabilizing factor such as teleportation. Another sort of 
problems arise when the unique stationary distribution is not an attractor, or when the rate of convergence 
is unfeasibly slow |7l3j . 

While very important in concrete applications, these problems, and their solutions, have less impact 
on the conceptual analyses pursued in this paper. We shall henceforth assume that all processes have been 
adjusted to induce unique and effectively computable ranking^ 



5.1 Promotion and reputation 

We now explain the intuition behind the simplest notions of rank. 

In social terms, the push coefficient Af^ = can be interpreted as measuring how much i supports (or 
advocates) j. The concept of promotion can then be formalized as a probability distribution r'^, such that 
rf = X)fc=i ^ifc^fe ■ words, the promotion rank (or push rank) rf of a node i is the sum of the promotion 
ranks of its children nodes, each allocated to i according to the push coefficient A^,, measuring i's support 
for k. 

Dually, the pull coefficient A^- can be interpreted as measuring how much i trusts j. The concept of 

reputation can then be formalized as a probability distribution r"*, such that — X^fcLi'^fe^fcj- This 
reputation rank (or pull rank) rf of a node i is thus the sum of the reputation ranks rj! of its parent nodes, 
each allocated according to the pull coefficient A'j^j, of fc's trust in j. 

^ The original version allowed to be 0, if Ai, is 0, i.e. if i is a "sink-hole", and the teleportation factor was added 

to save dynamics from such sinkholes. Other modifications were introduced later. 
* This implies that all notions of dynamics that we consider have a tacit damping factor. We do not display it only 

because it needlessly complicates formulas. 
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Gathering the promotion values in a column vector r"^ and the reputation values in a row vector , we 
can rewrite the definitions of r*' and in the matrix form 

r" = A-'r" = r" A" 

The refined notions of promotion r" and reputation r" are defined and interpreted along the same lines, as 
the stationary distributions of the processes A" and A" respectively. 

5.2 Expected flow 

While dynamics of reputation has been studied for a long time [llllOj , and with increased attention recently, 
since it become a crucial tool of Web search |19ll3j , the dual dynamics of promotion does not seem to have 
attracted much attention. We need both notions to define the expected traffic flow. 

The expected flow from j to k, under the assumption that they arc independent, is caused only by a 
"traffic pressure", resulting from the pull to j and the push from k. Following this idea, we define 

r^k = ^j^k (1) 

The expected flow r"^ is thus a probability distribution over N x N, which can be represented as the matrix 
j.-> — J.- . J.'- ^ obtained by multiplying the column vector r" and the row vector r"". Since r" and r"" are 
the principal ei^nvectors of A" and A*" , r" is the unique distribution satisfying r"^ ~ A" ■ r"" ■ A", i.e. 

= J2i=i Sf=i ^ij''^ie^k£ - Intuitively, this means that the flow pressure from i to i propagates to cause a 
flow pressure from j to k proportionally to the force of the traffic from i to j and to the force of traffic fiows 
from k to £ — provided that j and k are independent. In order to measure their dependency, we attempt to 
capture how the actual flows from i to £ (rather than mere flow pressure) may get diverted, say by the high 
costs and the low capacities, to cause actual flows from j to k. 

6 Path networks 

Definition 2. Given a v-closed network A ~ [R ^ — E I N) , we define the path network 
A ^ {R E =lt N), where N = E, and E = T,a,beE^o^b, with 

Eab = {/ = (/o, /l) € Eij X Ekl 

7(/) = 2d + 7(/o)+7(&) + 7(./i) 

i 



a 



£ 



l{h)+l{h)+i{h)-i{a)<v^2d} 
-7(a) 



(2) 
(3) 



Dynamics of path selection. Recalling that Aab — j, ^ '^^^\we define the forward and the backward 

dynamics, and the pull rank and the push rank just like before: 

^ab — '2 — ^ab ~ 

Aam Afb 

^b = ^ faA^b r^ = ^ ^ab^ 

a£N beJV 
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Intuitively, v4^^ is now the probability that traffic through a is diverted to b (rather than to some other path); 
while A'^f^ is the probability that traffic through b is diverted from a (and not from some other path). The 
pull rank fj,, i.e. the probability that b will be traversed, can thus be understood as its attraction; whereas 
is the probability that a will be avoided. 

Using the pull rank of the paths, we can now define the node attraction between j and k to be the total 
attraction of all paths between them: 

rjk = ^n (4) 

The idea is that this notion of attraction the nodes will allow us to refine the estimate of the traffic bias v 
as described in section [2] In particular, consider attraction bias 

^jk = rjk - r'^k (5) 
To motivate this, note that expanding the formula for in section 15.21 shows that r"^ is the stationary 
distribution of the Markov chain A"^ — ( A"^..,, ] , where 

4- _ AijAj,A,kAke r-- - V 4-' r'"' 

l,l£N 

On the other hand, the node attraction r turns out to be a stationary distribution of a process that refines 
A'-. 



Definition 3. Given a network A, its attraction dynamics is a Markov chain A = ^^(y")(fef)^ 



, with 



the entries 

2 Aj-jAjkAM 

H3)m = A. A A ^ (6) 



where Ai,A„A,( — X/m.nsAf 



ni- 



Proposition 1. Suppose that a given network A is v-complete for a sufficiently large v. Then the node 
attraction r, defined in is the stationary distribution of its attraction dynamics In other words, for 
every j, k holds 



^jk = ^ A(ij),^M)rii (7) 



The proof is in the Appendix. It is based on the following lemma. 

mma 1. For a network A, 
equations hold for i £ and j 



Lemma 1. For a network A, which is v- complete for a sufficiently large cutoff value v, the following 



Aab = -^^A,,Au (8) 
= -^^A.jAjuAm (9) 

A'^Aa 

On the other hand, proposition [T] implies the following corollary, which establishes that formula ([5]) can 
be used to measure the attraction bias, as intended. 
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Corollary 1. The directed reputation and promotion ranks are the marginals of the node attraction 

keN 

Interpretation. To understand the meaning of attraction bias, consider a ■y-complete network A, with the 
forward-out and backward-in dynamics. The pull rank tells how likely it is that a randomly sampled 
traffic path arrives to j; whereas the push rank tells how likely it is that a randomly sampled traffic path 
departs from k. 

On the other hand, the attraction dynamics in the induced path network A gives the node attraction rjk, 
which tells how likely it is that a randomly sampled traffic path traverses a path from j to k. In summary, 
we have 

= Prob(« ^ j \ A^) 

r^ = Prob(fc ^ • I ^ e A") 
fjfe = Prob(j ^ fc I e e A) 

Although the notation suggests that r" ^r" , and r are sampled from different processes, corollary[T]establishes 
that r is in fact the joint distribution of r" and r" . 

Nevertheless, a diligent reader will surely notice a twist of j and k in the last three equations, and wonder 
why is the probability that traffic goes from j to k related with the probabilities that it arrives to j, and that 
it departs Jrom kl — The answer to this question makes the forward- o«< and the backward- m dynamics 
into a more interesting example than its many dynamical cousins. Briefly, if the surfers are more likely to 
flow with if the capacity of the links out of j is higher, and if they are more likely to flow with fc^» if 
the capacity of the links into k is higher, then the surfers are most likely to follow both these flows, i.e. into 
j and out of fc — if there is a high capacity of the links j — > k. 

Mutual information of the inputs and the outputs. The fact that r is the joint distribution of the 
processes expressed by r" and r" allows us to extract their mutual information [3] 

N N ^ 

I{r--r-) = D{r\\r-) = ^^f^, log^ 

j = l k=l ''^3 

Its expression in terms of relative entropy || r") [ibidem] shows that it measures how much we lose, 
in the efficiency of encoding of r if we assume that r" and r" are mutually independent. Intuitively, the 
mutual information /(r" ; r") can thus be taken as a measure of the locality of information processing in A. 
If this is an entirely local process, then every path must begin and end at the same node, and the random 
walks S and g, selecting the sources and the destinations of the paths, must coincide. But ii 5 = g, then the 
push rank and the pull rank must obey the same distribution r" — r" — r, and their mutual information is 
/(r" ; r") — H{r), their entropy. In the other extreme case, the random walks S and g are independenlH, 
and their joint distribution is just the product of their distributions rjk = f'jf'k - Their mutual information 
is then /(r" ; r") — 0. 

7 Conclusions and future work 

When the Web is viewed as a global data store, the problem of its semantics is the problem of determining 
a uniform meaning for the data published by its various participants. The search engines are dealing with 

^ The theorem in the appendix suggests that they are similarly distributed, up to a scale factor. 
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this problem on the level of the human-Web interaction (e.g., distinguishing the meanings of the word 
"jaguar", sometimes denoting a car, sometimes an animal [12], or deciding whether "Paris Hilton", in a 
given context, refers to a person or to a hotel, etc.), whereas the Semantic Web project [T] deals with the 
computer- Web interactions. When the Web is viewed as a computer, the problem of its semantics is not just 
a matter of assigning some meanings to some data stored in it, but also to its data processing operations. For 
programming languages, this is what we usually call operational semantics. However, unlike a programming 
language, the Web, and other spontaneously evolving networks, do not have a formally defined set of data 
structures and operations: data are transformed by many random walks, running concurrently. Operational 
semantics of network computation requires a toolkit for incremental analysis of such processes. In this paper, 
we described a path ranking method, which is may become a useful piece of that toolkit. Now we sketch a way 
to test it experimentally. Using the notion of attraction bias, we lift the graph theoretic notion of (maximal) 
clique into rank analysis, while retaining network dynamics as a graph structure over such generalized cliques. 
We call these generalized cliques concepts and the links between them associations. 

Communities and concepts. Taking the notion of attraction bias back to the idea of communities as sets 
of nodes with high cohesion, from which we started in the Introduction, we now reformulate the notion of 
cohesion in a different norm (£^0 instead of £1), and define cohesion of a set of nodes [/ C to be their 
minimal symmetric attraction bias 

nu)= A m.-vT,o 

For each e G [0, 1], we define an e- community to be a set of nodes U C N such that T{U) > e. Denoting 
by pgN the set of e-communities, note that ei < implies that pe^N 3 p^^N. The partial ordering of 
U,V € peN is given by U ^ V <;=^ U C V A T{U) < T{V) This gives a directed complete partial order 
(dcpo). It is not a lattice because some communities cannot be extended by new nodes without decreasing 
their cohesion; so there are pairs of communities that cannot be joined, and do not have an upper bound. 
However, directed sets of communities (i.e., where each pair has an upper bound) do have least upper bounds, 
which are just their set theoretic unions. Directed complete partial orders are often used in denotational 
semantics of programming languages ,5 . According to that interpretation, communities can be thought of as 
pieces of partial information, their C- ordering as the increase of information, and the existence of an upper 
bound of two communities as the consistency of the informations that they carry. 

The maximal elements of peN , i.e. the communities that cannot be extended by new nodes without losing 
cohesion, can be construed as e-concepts. A set [/ g p^N is thus an e-concept if T({z,j}) > e holds for all 
i, j G U , but for every k ^ N \ U there is a j G ?7 such that T {{k,j}) < e. 

The community and concept structure of a network A can be analyzed by studying the sequence of 
hypergraphs A^, where the e-concepts, or the e-communities approximating them, are viewed as hyperedges. 
The sequence (A£)eg[o,i] decreases as the cohesion parameter e increases, and the highly cohesive communities 
and concepts can be feasibly analyzed. 

A level further, concepts and communities can be viewed as the nodes of a network. The most interesting 
definition of the links between them, intuitively thought of as associations, is based on a variant of a path 
network, complementing definition [2l A sketch of this definition is in the next, final subsection. 

Associations. Let 7V^ denote the set of e-concepts in a network A. 

The concept network A^, induced by a network A, has the e-concepts as its nodes. Its edges are called 
concept associations. The set of associations between U,V G A/"^ is 

E E (13) 

u~*unv unv^v 

a b 

where U V abbreviates ^(^) G C/ A £i(^) G V , and 

Eab = {f = (/o, /i) G E,j X Em I 7(/o) + l{h) < w - d and 7(a) + 7(/i) <v~d] 
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An association / G Auv is thus a quadruple / = (a, h, fo, fi) 

. fo 

I ^ J 

a b 

k > £ 

ji 

such that i,j,k € U and j, A;,^ G V. Its cost is 7(/) = 7(/o) +7(6) — 7(a) —7(/i). The cost of an association 
from U to y is lower if the traffic from i G U to I GV gets less costly when it crosses to V earlier. 

While the general network analysis tools apply to concept networks, the various notions of dynamics 
acquire new meanings on this level. At this point, understanding which of the possible interpretations may 
lead to useful tools for extracting and analyzing the relevant concepts, processed in a network, seems to call 
for experimentation with real data. 
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Appendix: Proofs 

Proof (of lemma\T^). The first claim is that there is a sufficiently large v such that 7(/o) +7(/i) < 

V — 2d hods for all /o G Eij and /i G Since 6 and d are fixed, the claim is clear if Eij and Eki are finite. 
Since A is assumed to be truncated complete, an infinite set of paths can only be generated from the links 
with a cost < 0. So the costs of the elements of Eij and Eki are in any case bounded. 

But if ah /o e Ei^j and /i e E^ satisfy 7(/o) + 7(6) + 7(/i) < t; - 2d, then Eab = x Em- Unfolding 
the definition of Aab and using ^ we get 

^ ^ ^ 2-2d-7(/o)-7(fc)-7(/l)+7(<i) 

2-7(6) 
2-7(a) 



9-7(0) / ' / ' 



[TP follows directly from[Tll8|), unpacking = 2-'^('=). And[IlI10l) then follows fromigS]): 

Aa» — ^ ^ ^afc 
h 

= E E^- 



¥Aa 

Proof (of proposition[J^^). We unfold the definition of r and then use ^ and PU)) : 



ee|^- 

■' b 



4d\^ ^ij kAkl ^ 



\ - \ - AijAjkAke 
A,,A„A,i 



EAijAjkAkt v^-- 
A^,A„A,i ^''"^ 

= E A^Mke)ne 
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Proof (of corollary \l\mi\) ). We set qj — J^keN^jk and expand rjk using proposition [TJI7]) , to show that q is 
the stationary distribution of the process A" : 



1j = H ^J fc 



AijAjkAki - 
Ai,A,,A,( 



J2 -ir^^^^ni 

k,i,ieN 



A.4m A-Mu 

i,eeN * 

Since r" is by definition the stationary point of A" , the uniqueness imphes q = r" 
To prove [Tljl2p , we set qi^ — ^2jeN'''Jf' and proceed similarly: 

j,i,eeN 

EAij AjkAki ^ 
A„A„A,t'^'^ 

EA,kAke^ 
a::a7/'' 



Since r" is by definition the stationary point of A", the uniqueness implies q = r' 
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